Claims 



What is claimed is: 

1. A process of identifying transcription factors such as activators and/or repressors 
comprising: 

providing cells with a nucleic acid sequence at least comprising a sequence CACCT (the first 5 
nucleotides of SEQ ID NO: 1), preferably twice a CACCT sequence (the first 5 
nucleotides of SEQ ID NO: 1), as bait(s) for the screening of a library encoding potential 
transcription factors and 

performing a specificity test to isolate said transcription factors. 

2. A process of identifying transcription factors such as activators and/or repressors 
comprising: 

providing cells with a nucleic acid sequence comprising one of the sequences CACCT-N- 

CACCT (SEQ ID NO: 1), CACCT-N-AGGTG (SEQ ID NO: 2), AGGTG-N-CACCT 
(SEQ ID NO: 3), or AGGTG-N-AGGTG (SEQ ID NO: 4) as bait wherein N is a spacer 
sequence. 



A process according to claim 1 or claim 2 wherein the transcription factor comprises 
separated clusters of zinc fihgers. 



4. A process according \o claim 1, claim 2, or claim 3 wherein the sequence originates from 
a promoter region. 



5. A process according td claim 4 wherein the promoter region is selected from the group 
"cMsistmgTrf^raehyuryT^^ E-cadherin. 



jn-Setof^prc 



:*hej3r^ss-ofxla^ claim 3, claim 4, or 



claim 5. 
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7. A process for identifying compounds with an interference capability towards 
transcription faetorsas-defined in-.claim_6_.b_y________ 

adding a sample comprising a potential compound to be identified to a test system comprising: 
(i) a nucleotide sequence comprising one of the sequences CACCT-N-CACCT (SEQ ID 
NO: 1), CACCT-N-AGGTG (SEQ ID NO: 2), AGGTG-N-CACCT (SEQ ID NO: 3), or 
AGGTG-N-AGGTG (SEQ ID NO: 4) as bait wherein N is a spacer, and (ii) a protein 
capable to bind said nucleotide sequence, 

incubating said sample in said system for a period of time sufficient to permit interaction of the 
compound or its derivative or counterpart thereof with said protein, 

comparing the amount and/or activity of the protein bound to the nucleotide sequence before and 
after said adding and 

identification and optionally isolation and/or purification of the compound. 

8. The process according to claim 7 wherein the protein is a Smad-interacting protein. 

9. The process according to claim 8, wherein said Smad-interacting protein is SIP1 . 

A compound^roduced by the process of claim 7, claim 8, or claim 9. 

11. The compound of claim 1 0, wherein said compound modifies regulation of E-cadherin 
expression by SIP1. 

A pharmaceutical composition to prevent tumor invasion and/or metastasis, said 
pharmaceutical composition comprising: 
the compoundbf claim 10 or claim 1 1 in an amount to prevent tumor invasion and/or metastasis 

in a subject, and 
-ai&armaceutica\^ 
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a nucleotide sequence comprising a sequence selected from the group consisting of CACCT-N- 
CAXCTtSEOiD-^ ID NO: 2), AGGTG-N-CAC CT 

(SEQ ID NO: 3), and AGGTG-N-AGGTG (SEQ ID NO: 4) wherein N is a spacer 
sequence and 

(ii)a protein capable of binding said nucleotide sequence. 

14. A test kit to perform the process of claim 2, said test kit comprising: 
a nucleic acid sequence comprising one of the sequences CACCT-N-CACCT (SEQ ID NO: 1), 
CACCT-N-AGGTG (SEQ ID NO: 2), AGGTG-N-CACCT (SEQ ID NO: 3), or AGGTG- 
N-AGGTG (SEQ ID NO: 4), wherein N is a spacer sequence. 

p % 15. A method for detecting an interaction between a first interacting protein and a second 

interacting protein comprising: 
W _ . 

{(3 providing a suitable host cell with a first fusion protein comprising a first interacting protein 

% ![ fused to a DNA binding domain capable to bind a nucleic acid sequence comprising one 

m of the sequences CACCT-N-CACCT (SEQ ID NO: 1), CACCT-N-AGGTG (SEQ ID 

NO: 2), AGGTG-N-CACCT (SEQ ID NO: 3), or AGGTG-N-AGGTG (SEQ ID NO: 4) 
wherein N is a spacer sequence, 
providing said suitable host cell with a second fusion protein comprising a second interacting 
protein fused to a DNA binding domain capable to bind a nucleic acid sequence 
comprising one of the sequences CACCT-N-CACCT (SEQ ID NO: 1), CACCT-N- 
AGGTG (SEQ ID NO: 2), AGGTG-N-CACCT (SEQ ID NO: 3) or AGGTG-N-AGGTG 
(SEQ ID NO: 4) wherein N is a spacer sequence, 
subjecting said host cell to conditions under which the first interacting protein and the second 

interacting protein are brought into close proximity and determining whether a detectable 
gene present in the host cell and located adjacent to said nucleic acid sequence has been 
— expressedOaJi_gmMerde^ in the absence of the interaction between 



the first and the second interacting protein. 
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16. An isolated nucleic acid sequence comprising a sequence selected from the group 
^i^i^f CACeT-N^eAeC-T-fSEQmNO-JXXA CCT-N-AGGTG (S EQ IDNQ:_2), 
AGGTG-N-CACCT (SEQ ID NO: 3), and AGGTG-N-AGGTG (SEQ ID NO: 4) wherein N 
spacer. 

17. A method of identifying a new target gene, said method comprising: 
identifying said new target gene using a nucleic acid sequence, said nucleic acid sequence 

comprising a sequence selected from the group consisting of CACCT (the first five 
nucleotides of SEQ ID NO: 1), CACCT-N-CACCT (SEQ ID NO: 1), CACCT-N- 
AGGTG (SEQ ID NO: 2), AGGTG-N-CACCT (SEQ ID NO: 3), and AGGTG-N- 
AGGTG (SEQ ID NO: 4) wherein N is a spacer. 
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